Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🤖 reinforcement learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
8807
posts in
253.5
ms
On
Computation
and
Reinforcement
Learning
arxiv.org
·
1d
🧩
operations research
i10e-lab/HelloRL
: A fully modular framework to make Reinforcement Learning quick and easy
github.com
·
17h
·
Discuss:
Hacker News
🧩
operations research
Distributional
Reinforcement Learning with Diffusion Bridge
Critics
arxiv.org
·
1d
📊
linear programming
Learning Models with Uniform Performance via
Distributionally
RobustOptimization
dev.to
·
1h
·
Discuss:
DEV
📊
linear programming
Distributed
Reinforcement Learning for
Scalable
High-Performance Policy Optimization
towardsdatascience.com
·
5d
📊
linear programming
Continual
learning and the post
monolith
AI era
baseten.co
·
15h
·
Discuss:
Hacker News
📊
linear programming
Multi-Agent Reinforcement Learning (
MARL
): Practical Guide to
Cooperative
and Competitive Learning
dev.to
·
1d
·
Discuss:
DEV
📊
linear programming
The control
layer
for AI
blog.dottxt.ai
·
14h
·
Discuss:
Hacker News
🦀
Rust
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
·
14h
·
Discuss:
Hacker News
🧩
operations research
Is Your Machine Learning
Pipeline
as Efficient as it Could Be?
kdnuggets.com
·
1d
📊
linear programming
Mechanistic
Interpretability:
Peeking
Inside an LLM
towardsdatascience.com
·
1d
📊
linear programming
A Neuro Symbolic Architecture For Induced
Epistemic
Agency and System 2 Reasoning in
Quantized
Large Language Models
papers.ssrn.com
·
1d
·
Discuss:
Hacker News
📊
linear programming
Mappa
– Fine-tune ANY multi-agent LLM systems end-to-end with AI
coaches
news.ycombinator.com
·
2d
·
Discuss:
Hacker News
📊
linear programming
Turning Coding
Tasks
into Feedback
Loops
feipeng.substack.com
·
2d
·
Discuss:
Substack
🧩
operations research
Knowledge-Creating
LLMs
tecunningham.github.io
·
6h
·
Discuss:
Hacker News
🧩
operations research
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
1d
·
Discuss:
Hacker News
📊
linear programming
Ottermon.ai
–
Effortless
Observability Deployed in Seconds
ottermon.ai
·
1d
·
Discuss:
Hacker News
🧩
operations research
First
Proof
| Research-Level
Math
for AI Evaluation
1stproof.org
·
1d
·
Discuss:
Hacker News
📊
linear programming
Generative
Pen-Trained
Transformer
theodore.net
·
1d
·
Discuss:
Hacker News
🦀
Rust
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
·
1d
·
Discuss:
Hacker News
🦀
Rust
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help